Automatic derivation of multiple variants of phonetic transcriptions from acoustic signals

نویسندگان

  • Houda Mokbel
  • Denis Jouvet
چکیده

This paper deals with two methods for automatically finding multiple phonetic transcriptions of words, given sample utterances of the words and an inventory of context-dependent subword units. The two approaches investigated are based on an analysis of theN -best phonetic decoding of the available utterances. In the set of transcriptions resulting from theN -best decoding of all the utterances, the first method selects theK most frequent variants (Frequency Criterion) , while the second method selects the K most likely ones (Maximum Likelihood Criterion). Experiments carried out on speaker-independent recognition showed that the performance obtained with the ”Maximum Likelihood Criterion” is not much different from that obtained with manual transcriptions. In the case of speaker-dependent speech recognition, the estimate of the 3 most likely transcription variants of each word, yields promising results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic reduction in conversational Dutch: A quantitative analysis based on automatically generated segmental transcriptions

In spontaneous, conversational speech, words are often reduced compared to their citation forms, such that a word like yesterday may sound like [’jESeI]. The present paper investigates such acoustic reduction . The study of reduction needs large corpora that are transcribed phonetically. The first part of this paper describes an automatic transcription procedure used to obtain such a large phon...

متن کامل

Automatic phonetic transcription of large speech corpora

This study is aimed at investigating whether automatic phonetic transcription procedures can approximate manual transcriptions typically delivered with contemporary large speech corpora. To this end, ten automatic procedures were used to generate a broad phonetic transcription of well-prepared speech (read-aloud texts) and spontaneous speech (telephone dialogues) from the Spoken Dutch Corpus. T...

متن کامل

Towards an Optimal Phonetic Representation of the Acoustic Signal as Starting Point for Search in Hsr Models

The research presented in this paper is concerned with the development of an automatic phone recogniser (APR) that converts an acoustic speech signal into a phone string, that meets the criterion that the number of phones should be ‘correct’. Two methods are investigated, viz. the optimisation of the APR on the ‘number of phones’ criterion and the introduction of variants in the phonetic transc...

متن کامل

Automatic Phonetic Transcription by Phonological Derivation

Automatic phonetic transcription tools usually perform phonetic transcriptions directly from orthographic representations. Although these approaches often achieve good results, theoretical studies suggest that including morphophonological knowledge allows those systems to improve their performance. Following this idea, we developed a tool which first obtains an underlying representation of each...

متن کامل

Phonetic labeling and segmentation of mixed-lingual prosody databases

An automatic system for segmenting speech signals used for the training of statistical prosody models is presented. Starting from a canonical transcription, the system simultaneously delivers an accurate phonetic segmentation and the matched phonetic transcription indicating pronunciation variants. Although the system is HMM-based, it uses only the speech signals of the prosody database which t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997